Dataset statistics
| Number of variables | 34 |
|---|---|
| Number of observations | 353035 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 91.6 MiB |
| Average record size in memory | 272.0 B |
Variable types
| Numeric | 26 |
|---|---|
| Categorical | 8 |
tripduration is highly overall correlated with dist | High correlation |
dist is highly overall correlated with end_lat and 1 other fields | High correlation |
birthyear is highly overall correlated with usertype and 2 other fields | High correlation |
years_old is highly overall correlated with usertype and 2 other fields | High correlation |
tempmax is highly overall correlated with month and 12 other fields | High correlation |
tempmin is highly overall correlated with month and 10 other fields | High correlation |
temp is highly overall correlated with month and 12 other fields | High correlation |
feelslike is highly overall correlated with month and 12 other fields | High correlation |
precip is highly overall correlated with visibility and 1 other fields | High correlation |
dew is highly overall correlated with month and 11 other fields | High correlation |
humidity is highly overall correlated with tempmax and 9 other fields | High correlation |
snow is highly overall correlated with snowdepth and 2 other fields | High correlation |
snowdepth is highly overall correlated with tempmax and 6 other fields | High correlation |
visibility is highly overall correlated with precip and 5 other fields | High correlation |
solarradiation is highly overall correlated with month and 12 other fields | High correlation |
cloudcover is highly overall correlated with tempmax and 8 other fields | High correlation |
usertype is highly overall correlated with birthyear and 1 other fields | High correlation |
gender is highly overall correlated with birthyear and 1 other fields | High correlation |
month is highly overall correlated with tempmax and 8 other fields | High correlation |
conditions is highly overall correlated with tempmax and 8 other fields | High correlation |
description is highly overall correlated with month and 15 other fields | High correlation |
seasons is highly overall correlated with month and 7 other fields | High correlation |
start_lon is highly overall correlated with start_station_id and 2 other fields | High correlation |
end_lon is highly overall correlated with start_lon and 3 other fields | High correlation |
start_station_id is highly overall correlated with start_lat and 1 other fields | High correlation |
start_lat is highly overall correlated with start_station_id and 2 other fields | High correlation |
end_station_id is highly overall correlated with end_lat and 1 other fields | High correlation |
end_lat is highly overall correlated with start_lat and 3 other fields | High correlation |
windspeed is highly overall correlated with month and 7 other fields | High correlation |
min has 5390 (1.5%) zeros | Zeros |
tempmin has 4161 (1.2%) zeros | Zeros |
precip has 200641 (56.8%) zeros | Zeros |
snow has 345872 (98.0%) zeros | Zeros |
snowdepth has 336604 (95.3%) zeros | Zeros |
Reproduction
| Analysis started | 2023-02-17 11:41:36.929010 |
|---|---|
| Analysis finished | 2023-02-17 11:48:42.839580 |
| Duration | 7 minutes and 5.91 seconds |
| Software version | pandas-profiling vv3.5.0 |
| Download configuration | config.json |
tripduration
Real number (ℝ)
| Distinct | 5749 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 525.93321 |
| Minimum | 61 |
|---|---|
| Maximum | 14395 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 61 |
|---|---|
| 5-th percentile | 131 |
| Q1 | 228 |
| median | 334 |
| Q3 | 548 |
| 95-th percentile | 1507 |
| Maximum | 14395 |
| Range | 14334 |
| Interquartile range (IQR) | 320 |
Descriptive statistics
| Standard deviation | 731.10143 |
|---|---|
| Coefficient of variation (CV) | 1.3901032 |
| Kurtosis | 90.39266 |
| Mean | 525.93321 |
| Median Absolute Deviation (MAD) | 133 |
| Skewness | 7.644811 |
| Sum | 1.8567283 × 108 |
| Variance | 534509.31 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 244 | 1014 | 0.3% |
| 246 | 998 | 0.3% |
| 260 | 984 | 0.3% |
| 233 | 978 | 0.3% |
| 242 | 976 | 0.3% |
| 243 | 971 | 0.3% |
| 266 | 960 | 0.3% |
| 250 | 956 | 0.3% |
| 247 | 956 | 0.3% |
| 263 | 953 | 0.3% |
| Other values (5739) | 343289 |
| Value | Count | Frequency (%) |
| 61 | 72 | |
| 62 | 72 | |
| 63 | 93 | |
| 64 | 82 | |
| 65 | 80 | |
| 66 | 96 | |
| 67 | 112 | |
| 68 | 90 | |
| 69 | 119 | |
| 70 | 110 |
| Value | Count | Frequency (%) |
| 14395 | 2 | |
| 14380 | 1 | |
| 14367 | 1 | |
| 14362 | 1 | |
| 14350 | 1 | |
| 14345 | 1 | |
| 14338 | 1 | |
| 14304 | 1 | |
| 14301 | 1 | |
| 14291 | 1 |
start_station_id
Real number (ℝ)
| Distinct | 59 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3265.0056 |
| Minimum | 3183 |
|---|---|
| Maximum | 3694 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 3183 |
|---|---|
| 5-th percentile | 3183 |
| Q1 | 3192 |
| median | 3205 |
| Q3 | 3272 |
| 95-th percentile | 3639 |
| Maximum | 3694 |
| Range | 511 |
| Interquartile range (IQR) | 80 |
Descriptive statistics
| Standard deviation | 138.43089 |
|---|---|
| Coefficient of variation (CV) | 0.042398363 |
| Kurtosis | 3.3999331 |
| Mean | 3265.0056 |
| Median Absolute Deviation (MAD) | 19 |
| Skewness | 2.2057689 |
| Sum | 1.1526612 × 109 |
| Variance | 19163.112 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3186 | 40830 | 11.6% |
| 3203 | 20842 | 5.9% |
| 3183 | 18936 | 5.4% |
| 3195 | 18149 | 5.1% |
| 3202 | 15194 | 4.3% |
| 3639 | 12088 | 3.4% |
| 3267 | 10515 | 3.0% |
| 3276 | 10330 | 2.9% |
| 3199 | 10055 | 2.8% |
| 3211 | 9437 | 2.7% |
| Other values (49) | 186659 |
| Value | Count | Frequency (%) |
| 3183 | 18936 | |
| 3184 | 8696 | 2.5% |
| 3185 | 8804 | 2.5% |
| 3186 | 40830 | |
| 3187 | 9245 | 2.6% |
| 3188 | 49 | < 0.1% |
| 3189 | 42 | < 0.1% |
| 3190 | 153 | < 0.1% |
| 3191 | 1015 | 0.3% |
| 3192 | 7040 | 2.0% |
| Value | Count | Frequency (%) |
| 3694 | 390 | 0.1% |
| 3681 | 4238 | 1.2% |
| 3679 | 2411 | 0.7% |
| 3678 | 2523 | 0.7% |
| 3677 | 1157 | 0.3% |
| 3640 | 5810 | |
| 3639 | 12088 | |
| 3638 | 7323 | |
| 3483 | 2617 | 0.7% |
| 3481 | 3296 | 0.9% |
start_lat
Real number (ℝ)
| Distinct | 59 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.722726 |
| Minimum | 40.69264 |
|---|---|
| Maximum | 40.748716 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 40.69264 |
|---|---|
| 5-th percentile | 40.712419 |
| Q1 | 40.718211 |
| median | 40.721525 |
| Q3 | 40.727224 |
| 95-th percentile | 40.737604 |
| Maximum | 40.748716 |
| Range | 0.056075979 |
| Interquartile range (IQR) | 0.0090122 |
Descriptive statistics
| Standard deviation | 0.007249213 |
|---|---|
| Coefficient of variation (CV) | 0.00017801394 |
| Kurtosis | 1.3678635 |
| Mean | 40.722726 |
| Median Absolute Deviation (MAD) | 0.0044865796 |
| Skewness | 0.97789544 |
| Sum | 14376548 |
| Variance | 5.2551089 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.71958612 | 40830 | 11.6% |
| 40.72759597 | 20842 | 5.9% |
| 40.7162469 | 18936 | 5.4% |
| 40.73074263 | 18149 | 5.1% |
| 40.7272235 | 15194 | 4.3% |
| 40.7192517 | 12088 | 3.4% |
| 40.71241882 | 10515 | 3.0% |
| 40.71458404 | 10330 | 2.9% |
| 40.7287448 | 10055 | 2.8% |
| 40.72152515 | 9437 | 2.7% |
| Other values (49) | 186659 |
| Value | Count | Frequency (%) |
| 40.69263997 | 17 | < 0.1% |
| 40.6970299 | 14 | < 0.1% |
| 40.69865054 | 34 | < 0.1% |
| 40.70495752 | 23 | < 0.1% |
| 40.70965083 | 52 | < 0.1% |
| 40.7101087 | 49 | < 0.1% |
| 40.71046702 | 153 | < 0.1% |
| 40.71113 | 390 | 0.1% |
| 40.7111305 | 46 | < 0.1% |
| 40.7112423 | 7040 |
| Value | Count | Frequency (%) |
| 40.74871595 | 2556 | |
| 40.74590997 | 2676 | |
| 40.7443187 | 2066 | 0.6% |
| 40.74267714 | 4420 | |
| 40.737711 | 1167 | 0.3% |
| 40.7376037 | 5377 | |
| 40.73496102 | 2024 | 0.6% |
| 40.73478582 | 2407 | |
| 40.73367 | 5810 | |
| 40.7311689 | 3067 |
start_lon
Real number (ℝ)
| Distinct | 59 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -74.046039 |
| Minimum | -74.096937 |
|---|---|
| Maximum | -74.032108 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 353035 |
| Negative (%) | 100.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | -74.096937 |
|---|---|
| 5-th percentile | -74.067622 |
| Q1 | -74.050444 |
| median | -74.043845 |
| Q3 | -74.038051 |
| 95-th percentile | -74.033459 |
| Maximum | -74.032108 |
| Range | 0.0648284 |
| Interquartile range (IQR) | 0.012392686 |
Descriptive statistics
| Standard deviation | 0.010753324 |
|---|---|
| Coefficient of variation (CV) | -0.00014522484 |
| Kurtosis | 0.47010856 |
| Mean | -74.046039 |
| Median Absolute Deviation (MAD) | 0.0061616919 |
| Skewness | -0.96456733 |
| Sum | -26140844 |
| Variance | 0.00011563399 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -74.04311746 | 40830 | 11.6% |
| -74.04424731 | 20842 | 5.9% |
| -74.0334588 | 18936 | 5.4% |
| -74.06378388 | 18149 | 5.1% |
| -74.0337589 | 15194 | 4.3% |
| -74.034234 | 12088 | 3.4% |
| -74.03852552 | 10515 | 3.0% |
| -74.04281706 | 10330 | 2.9% |
| -74.0321082 | 10055 | 2.8% |
| -74.04630454 | 9437 | 2.7% |
| Other values (49) | 186659 |
| Value | Count | Frequency (%) |
| -74.0969366 | 14 | < 0.1% |
| -74.0887723 | 42 | < 0.1% |
| -74.08801228 | 17 | < 0.1% |
| -74.08593088 | 23 | < 0.1% |
| -74.0858489 | 49 | < 0.1% |
| -74.0836394 | 1015 | 0.3% |
| -74.08207968 | 34 | < 0.1% |
| -74.0789 | 390 | 0.1% |
| -74.0788855 | 46 | < 0.1% |
| -74.07840595 | 2565 |
| Value | Count | Frequency (%) |
| -74.0321082 | 10055 | |
| -74.0334588 | 18936 | |
| -74.0335519 | 8696 | |
| -74.0337589 | 15194 | |
| -74.034234 | 12088 | |
| -74.0354826 | 7323 | 2.1% |
| -74.0364857 | 6423 | 1.8% |
| -74.03768331 | 4238 | 1.2% |
| -74.03805095 | 9245 | |
| -74.03852552 | 10515 |
end_station_id
Real number (ℝ)
| Distinct | 121 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3258.3976 |
| Minimum | 127 |
|---|---|
| Maximum | 3694 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 127 |
|---|---|
| 5-th percentile | 3183 |
| Q1 | 3186 |
| median | 3203 |
| Q3 | 3270 |
| 95-th percentile | 3639 |
| Maximum | 3694 |
| Range | 3567 |
| Interquartile range (IQR) | 84 |
Descriptive statistics
| Standard deviation | 147.20129 |
|---|---|
| Coefficient of variation (CV) | 0.045175974 |
| Kurtosis | 64.396528 |
| Mean | 3258.3976 |
| Median Absolute Deviation (MAD) | 17 |
| Skewness | -1.3715088 |
| Sum | 1.1503284 × 109 |
| Variance | 21668.219 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3186 | 50536 | 14.3% |
| 3183 | 24065 | 6.8% |
| 3203 | 19635 | 5.6% |
| 3202 | 16286 | 4.6% |
| 3195 | 16283 | 4.6% |
| 3639 | 11620 | 3.3% |
| 3276 | 10055 | 2.8% |
| 3199 | 9994 | 2.8% |
| 3211 | 9574 | 2.7% |
| 3185 | 9438 | 2.7% |
| Other values (111) | 175549 |
| Value | Count | Frequency (%) |
| 127 | 1 | < 0.1% |
| 146 | 3 | < 0.1% |
| 157 | 9 | |
| 167 | 1 | < 0.1% |
| 212 | 1 | < 0.1% |
| 254 | 1 | < 0.1% |
| 259 | 1 | < 0.1% |
| 264 | 1 | < 0.1% |
| 276 | 1 | < 0.1% |
| 303 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 3694 | 333 | 0.1% |
| 3681 | 4046 | 1.1% |
| 3679 | 2100 | 0.6% |
| 3678 | 1915 | 0.5% |
| 3677 | 814 | 0.2% |
| 3640 | 5287 | |
| 3639 | 11620 | |
| 3638 | 7348 | |
| 3552 | 1 | < 0.1% |
| 3547 | 3 | < 0.1% |
end_lat
Real number (ℝ)
| Distinct | 121 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 40.722327 |
| Minimum | 40.679331 |
|---|---|
| Maximum | 40.814326 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 40.679331 |
|---|---|
| 5-th percentile | 40.712774 |
| Q1 | 40.717732 |
| median | 40.721124 |
| Q3 | 40.727224 |
| 95-th percentile | 40.734961 |
| Maximum | 40.814326 |
| Range | 0.1349949 |
| Interquartile range (IQR) | 0.009491 |
Descriptive statistics
| Standard deviation | 0.0070856291 |
|---|---|
| Coefficient of variation (CV) | 0.00017399863 |
| Kurtosis | 2.1663154 |
| Mean | 40.722327 |
| Median Absolute Deviation (MAD) | 0.0046025374 |
| Skewness | 1.1086587 |
| Sum | 14376407 |
| Variance | 5.020614 × 10-5 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40.71958612 | 50536 | 14.3% |
| 40.7162469 | 24065 | 6.8% |
| 40.72759597 | 19635 | 5.6% |
| 40.7272235 | 16286 | 4.6% |
| 40.73074263 | 16283 | 4.6% |
| 40.7192517 | 11620 | 3.3% |
| 40.71458404 | 10055 | 2.8% |
| 40.7287448 | 9994 | 2.8% |
| 40.72152515 | 9574 | 2.7% |
| 40.7177325 | 9438 | 2.7% |
| Other values (111) | 175549 |
| Value | Count | Frequency (%) |
| 40.6793307 | 1 | < 0.1% |
| 40.68539567 | 1 | < 0.1% |
| 40.68763155 | 1 | < 0.1% |
| 40.69089272 | 9 | < 0.1% |
| 40.69165183 | 6 | < 0.1% |
| 40.69263997 | 15 | |
| 40.6970299 | 22 | |
| 40.69865054 | 31 | |
| 40.70122128 | 1 | < 0.1% |
| 40.701907 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 40.8143256 | 3 | |
| 40.8067581 | 1 | < 0.1% |
| 40.805973 | 1 | < 0.1% |
| 40.7961535 | 1 | < 0.1% |
| 40.7746671 | 1 | < 0.1% |
| 40.7734066 | 2 | |
| 40.770513 | 1 | < 0.1% |
| 40.768254 | 1 | < 0.1% |
| 40.76590936 | 1 | < 0.1% |
| 40.76370739 | 4 |
end_lon
Real number (ℝ)
| Distinct | 121 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -74.045504 |
| Minimum | -74.096937 |
|---|---|
| Maximum | -73.947821 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 353035 |
| Negative (%) | 100.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | -74.096937 |
|---|---|
| 5-th percentile | -74.066921 |
| Q1 | -74.049968 |
| median | -74.043117 |
| Q3 | -74.037683 |
| 95-th percentile | -74.033459 |
| Maximum | -73.947821 |
| Range | 0.14911515 |
| Interquartile range (IQR) | 0.012284517 |
Descriptive statistics
| Standard deviation | 0.010750348 |
|---|---|
| Coefficient of variation (CV) | -0.0001451857 |
| Kurtosis | 1.0340059 |
| Mean | -74.045504 |
| Median Absolute Deviation (MAD) | 0.0066317636 |
| Skewness | -1.0355317 |
| Sum | -26140654 |
| Variance | 0.00011556998 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -74.04311746 | 50536 | 14.3% |
| -74.0334588 | 24065 | 6.8% |
| -74.04424731 | 19635 | 5.6% |
| -74.0337589 | 16286 | 4.6% |
| -74.06378388 | 16283 | 4.6% |
| -74.034234 | 11620 | 3.3% |
| -74.04281706 | 10055 | 2.8% |
| -74.0321082 | 9994 | 2.8% |
| -74.04630454 | 9574 | 2.7% |
| -74.043845 | 9438 | 2.7% |
| Other values (111) | 175549 |
| Value | Count | Frequency (%) |
| -74.0969366 | 22 | < 0.1% |
| -74.0887723 | 96 | < 0.1% |
| -74.08801228 | 15 | < 0.1% |
| -74.08593088 | 23 | < 0.1% |
| -74.0858489 | 50 | < 0.1% |
| -74.0836394 | 1295 | |
| -74.08207968 | 31 | < 0.1% |
| -74.0789 | 333 | 0.1% |
| -74.0788855 | 32 | < 0.1% |
| -74.07840595 | 3211 |
| Value | Count | Frequency (%) |
| -73.94782145 | 1 | < 0.1% |
| -73.9590255 | 3 | |
| -73.9607082 | 1 | < 0.1% |
| -73.964928 | 1 | < 0.1% |
| -73.97031366 | 1 | < 0.1% |
| -73.97121214 | 1 | < 0.1% |
| -73.97431458 | 1 | < 0.1% |
| -73.97498696 | 1 | < 0.1% |
| -73.97519523 | 1 | < 0.1% |
| -73.97604882 | 1 | < 0.1% |
bikeid
Real number (ℝ)
| Distinct | 903 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29452.978 |
| Minimum | 14697 |
|---|---|
| Maximum | 35009 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 14697 |
|---|---|
| 5-th percentile | 26183 |
| Q1 | 26315 |
| median | 29493 |
| Q3 | 29679 |
| 95-th percentile | 33638 |
| Maximum | 35009 |
| Range | 20312 |
| Interquartile range (IQR) | 3364 |
Descriptive statistics
| Standard deviation | 2529.8244 |
|---|---|
| Coefficient of variation (CV) | 0.085893671 |
| Kurtosis | 0.83380671 |
| Mean | 29452.978 |
| Median Absolute Deviation (MAD) | 1911 |
| Skewness | -0.12760211 |
| Sum | 1.0397932 × 1010 |
| Variance | 6400011.7 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26155 | 872 | 0.2% |
| 26288 | 852 | 0.2% |
| 29586 | 840 | 0.2% |
| 29598 | 832 | 0.2% |
| 29608 | 831 | 0.2% |
| 29595 | 824 | 0.2% |
| 29669 | 813 | 0.2% |
| 29583 | 810 | 0.2% |
| 29602 | 804 | 0.2% |
| 29662 | 794 | 0.2% |
| Other values (893) | 344763 |
| Value | Count | Frequency (%) |
| 14697 | 11 | < 0.1% |
| 14793 | 18 | < 0.1% |
| 14956 | 5 | < 0.1% |
| 14977 | 49 | |
| 14991 | 43 | < 0.1% |
| 15114 | 47 | < 0.1% |
| 15271 | 25 | < 0.1% |
| 15302 | 118 | |
| 15444 | 15 | < 0.1% |
| 15582 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 35009 | 37 | < 0.1% |
| 34791 | 10 | < 0.1% |
| 34676 | 10 | < 0.1% |
| 34664 | 1 | < 0.1% |
| 34354 | 31 | < 0.1% |
| 34155 | 1 | < 0.1% |
| 33840 | 164 | |
| 33814 | 155 | |
| 33781 | 160 | |
| 33744 | 83 |
usertype
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
| Subscriber | |
|---|---|
| Customer | 21766 |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.8766921 |
| Min length | 8 |
Characters and Unicode
| Total characters | 3486818 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Subscriber |
|---|---|
| 2nd row | Subscriber |
| 3rd row | Subscriber |
| 4th row | Subscriber |
| 5th row | Subscriber |
Common Values
| Value | Count | Frequency (%) |
| Subscriber | 331269 | |
| Customer | 21766 | 6.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| subscriber | 331269 | |
| customer | 21766 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 684304 | |
| b | 662538 | |
| u | 353035 | |
| s | 353035 | |
| e | 353035 | |
| S | 331269 | |
| c | 331269 | |
| i | 331269 | |
| C | 21766 | 0.6% |
| t | 21766 | 0.6% |
| Other values (2) | 43532 | 1.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3133783 | |
| Uppercase Letter | 353035 | 10.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 684304 | |
| b | 662538 | |
| u | 353035 | |
| s | 353035 | |
| e | 353035 | |
| c | 331269 | |
| i | 331269 | |
| t | 21766 | 0.7% |
| o | 21766 | 0.7% |
| m | 21766 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 331269 | |
| C | 21766 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3486818 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 684304 | |
| b | 662538 | |
| u | 353035 | |
| s | 353035 | |
| e | 353035 | |
| S | 331269 | |
| c | 331269 | |
| i | 331269 | |
| C | 21766 | 0.6% |
| t | 21766 | 0.6% |
| Other values (2) | 43532 | 1.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3486818 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 684304 | |
| b | 662538 | |
| u | 353035 | |
| s | 353035 | |
| e | 353035 | |
| S | 331269 | |
| c | 331269 | |
| i | 331269 | |
| C | 21766 | 0.6% |
| t | 21766 | 0.6% |
| Other values (2) | 43532 | 1.2% |
gender
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
| male | |
|---|---|
| female | |
| unknown | 20963 |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.6039939 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1625371 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | male |
|---|---|
| 2nd row | female |
| 3rd row | male |
| 4th row | male |
| 5th row | male |
Common Values
| Value | Count | Frequency (%) |
| male | 256901 | |
| female | 75171 | 21.3% |
| unknown | 20963 | 5.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| male | 256901 | |
| female | 75171 | 21.3% |
| unknown | 20963 | 5.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 407243 | |
| m | 332072 | |
| a | 332072 | |
| l | 332072 | |
| f | 75171 | 4.6% |
| n | 62889 | 3.9% |
| u | 20963 | 1.3% |
| k | 20963 | 1.3% |
| o | 20963 | 1.3% |
| w | 20963 | 1.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1625371 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 407243 | |
| m | 332072 | |
| a | 332072 | |
| l | 332072 | |
| f | 75171 | 4.6% |
| n | 62889 | 3.9% |
| u | 20963 | 1.3% |
| k | 20963 | 1.3% |
| o | 20963 | 1.3% |
| w | 20963 | 1.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1625371 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 407243 | |
| m | 332072 | |
| a | 332072 | |
| l | 332072 | |
| f | 75171 | 4.6% |
| n | 62889 | 3.9% |
| u | 20963 | 1.3% |
| k | 20963 | 1.3% |
| o | 20963 | 1.3% |
| w | 20963 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1625371 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 407243 | |
| m | 332072 | |
| a | 332072 | |
| l | 332072 | |
| f | 75171 | 4.6% |
| n | 62889 | 3.9% |
| u | 20963 | 1.3% |
| k | 20963 | 1.3% |
| o | 20963 | 1.3% |
| w | 20963 | 1.3% |
dist
Real number (ℝ)
| Distinct | 2663 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.78422928 |
| Minimum | 0 |
|---|---|
| Maximum | 9.1122582 |
| Zeros | 2325 |
| Zeros (%) | 0.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.15866545 |
| Q1 | 0.43349609 |
| median | 0.65904833 |
| Q3 | 0.96379915 |
| 95-th percentile | 1.8366588 |
| Maximum | 9.1122582 |
| Range | 9.1122582 |
| Interquartile range (IQR) | 0.53030306 |
Descriptive statistics
| Standard deviation | 0.542201 |
|---|---|
| Coefficient of variation (CV) | 0.69138071 |
| Kurtosis | 5.2340002 |
| Mean | 0.78422928 |
| Median Absolute Deviation (MAD) | 0.24335259 |
| Skewness | 1.8033726 |
| Sum | 276860.38 |
| Variance | 0.29398192 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.6590483332 | 7869 | 2.2% |
| 0.6590574045 | 5870 | 1.7% |
| 0.4156981062 | 4599 | 1.3% |
| 0.7841604255 | 4263 | 1.2% |
| 0.4396170481 | 4131 | 1.2% |
| 0.7508857537 | 3984 | 1.1% |
| 0.415695745 | 3318 | 0.9% |
| 0.6185235707 | 3279 | 0.9% |
| 0.4188632139 | 3200 | 0.9% |
| 0.34202925 | 3176 | 0.9% |
| Other values (2653) | 309346 |
| Value | Count | Frequency (%) |
| 0 | 2325 | |
| 9.818762165 × 10-13 | 1961 | |
| 1.488035366 × 10-12 | 84 | < 0.1% |
| 1.488100836 × 10-12 | 238 | 0.1% |
| 1.488206632 × 10-12 | 96 | < 0.1% |
| 1.488245041 × 10-12 | 421 | 0.1% |
| 1.488391755 × 10-12 | 412 | 0.1% |
| 1.488418421 × 10-12 | 461 | 0.1% |
| 1.488818307 × 10-12 | 3 | < 0.1% |
| 1.488952632 × 10-12 | 2 | < 0.1% |
| Value | Count | Frequency (%) |
| 9.112258173 | 1 | |
| 9.041788085 | 1 | |
| 8.954820805 | 1 | |
| 8.727328843 | 1 | |
| 8.622532883 | 1 | |
| 8.568639995 | 1 | |
| 7.023637674 | 1 | |
| 6.04772788 | 1 | |
| 5.887380573 | 1 | |
| 5.840948824 | 1 |
month
Categorical
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
| August | |
|---|---|
| July | |
| June | |
| October | |
| September | |
| Other values (7) |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 5.9405696 |
| Min length | 3 |
Characters and Unicode
| Total characters | 2097229 |
|---|---|
| Distinct characters | 26 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | January |
|---|---|
| 2nd row | January |
| 3rd row | January |
| 4th row | January |
| 5th row | January |
Common Values
| Value | Count | Frequency (%) |
| August | 44325 | |
| July | 42171 | |
| June | 40820 | |
| October | 39044 | |
| September | 38919 | |
| May | 34352 | |
| November | 24876 | |
| April | 23574 | |
| December | 20182 | |
| March | 17064 | 4.8% |
| Other values (2) | 27708 |
Length
| Value | Count | Frequency (%) |
| august | 44325 | |
| july | 42171 | |
| june | 40820 | |
| october | 39044 | |
| september | 38919 | |
| may | 34352 | |
| november | 24876 | |
| april | 23574 | |
| december | 20182 | |
| march | 17064 | 4.8% |
| Other values (2) | 27708 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 321989 | |
| r | 206437 | 9.8% |
| u | 199349 | 9.5% |
| b | 138091 | 6.6% |
| t | 122288 | 5.8% |
| y | 104231 | 5.0% |
| J | 95629 | 4.6% |
| a | 91762 | 4.4% |
| m | 83977 | 4.0% |
| c | 76290 | 3.6% |
| Other values (16) | 657186 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1744194 | |
| Uppercase Letter | 353035 | 16.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 321989 | |
| r | 206437 | |
| u | 199349 | |
| b | 138091 | |
| t | 122288 | 7.0% |
| y | 104231 | 6.0% |
| a | 91762 | 5.3% |
| m | 83977 | 4.8% |
| c | 76290 | 4.4% |
| l | 65745 | 3.8% |
| Other values (8) | 334035 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 95629 | |
| A | 67899 | |
| M | 51416 | |
| O | 39044 | |
| S | 38919 | |
| N | 24876 | 7.0% |
| D | 20182 | 5.7% |
| F | 15070 | 4.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2097229 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 321989 | |
| r | 206437 | 9.8% |
| u | 199349 | 9.5% |
| b | 138091 | 6.6% |
| t | 122288 | 5.8% |
| y | 104231 | 5.0% |
| J | 95629 | 4.6% |
| a | 91762 | 4.4% |
| m | 83977 | 4.0% |
| c | 76290 | 3.6% |
| Other values (16) | 657186 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2097229 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 321989 | |
| r | 206437 | 9.8% |
| u | 199349 | 9.5% |
| b | 138091 | 6.6% |
| t | 122288 | 5.8% |
| y | 104231 | 5.0% |
| J | 95629 | 4.6% |
| a | 91762 | 4.4% |
| m | 83977 | 4.0% |
| c | 76290 | 3.6% |
| Other values (16) | 657186 |
day
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
| weekday | |
|---|---|
| weekend |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 2471245 |
|---|---|
| Distinct characters | 7 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | weekday |
|---|---|
| 2nd row | weekday |
| 3rd row | weekday |
| 4th row | weekday |
| 5th row | weekday |
Common Values
| Value | Count | Frequency (%) |
| weekday | 278318 | |
| weekend | 74717 | 21.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| weekday | 278318 | |
| weekend | 74717 | 21.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 780787 | |
| w | 353035 | |
| k | 353035 | |
| d | 353035 | |
| a | 278318 | 11.3% |
| y | 278318 | 11.3% |
| n | 74717 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2471245 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 780787 | |
| w | 353035 | |
| k | 353035 | |
| d | 353035 | |
| a | 278318 | 11.3% |
| y | 278318 | 11.3% |
| n | 74717 | 3.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2471245 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 780787 | |
| w | 353035 | |
| k | 353035 | |
| d | 353035 | |
| a | 278318 | 11.3% |
| y | 278318 | 11.3% |
| n | 74717 | 3.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2471245 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 780787 | |
| w | 353035 | |
| k | 353035 | |
| d | 353035 | |
| a | 278318 | 11.3% |
| y | 278318 | 11.3% |
| n | 74717 | 3.0% |
hour
Real number (ℝ)
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.633679 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 2750 |
| Zeros (%) | 0.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 6 |
| Q1 | 9 |
| median | 14 |
| Q3 | 18 |
| 95-th percentile | 21 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 9 |
Descriptive statistics
| Standard deviation | 5.1565432 |
|---|---|
| Coefficient of variation (CV) | 0.37822095 |
| Kurtosis | -1.0104156 |
| Mean | 13.633679 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.20483033 |
| Sum | 4813166 |
| Variance | 26.589937 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 8 | 42943 | |
| 18 | 37328 | 10.6% |
| 17 | 32759 | 9.3% |
| 19 | 26627 | 7.5% |
| 7 | 24169 | 6.8% |
| 9 | 21232 | 6.0% |
| 20 | 18473 | 5.2% |
| 16 | 18405 | 5.2% |
| 12 | 15123 | 4.3% |
| 15 | 15058 | 4.3% |
| Other values (14) | 100918 |
| Value | Count | Frequency (%) |
| 0 | 2750 | 0.8% |
| 1 | 1387 | 0.4% |
| 2 | 744 | 0.2% |
| 3 | 519 | 0.1% |
| 4 | 786 | 0.2% |
| 5 | 3531 | 1.0% |
| 6 | 10156 | 2.9% |
| 7 | 24169 | |
| 8 | 42943 | |
| 9 | 21232 |
| Value | Count | Frequency (%) |
| 23 | 4671 | 1.3% |
| 22 | 8055 | 2.3% |
| 21 | 12194 | 3.5% |
| 20 | 18473 | |
| 19 | 26627 | |
| 18 | 37328 | |
| 17 | 32759 | |
| 16 | 18405 | |
| 15 | 15058 | |
| 14 | 14224 | 4.0% |
min
Real number (ℝ)
| Distinct | 60 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29.544399 |
| Minimum | 0 |
|---|---|
| Maximum | 59 |
| Zeros | 5390 |
| Zeros (%) | 1.5% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 15 |
| median | 30 |
| Q3 | 44 |
| 95-th percentile | 56 |
| Maximum | 59 |
| Range | 59 |
| Interquartile range (IQR) | 29 |
Descriptive statistics
| Standard deviation | 17.265944 |
|---|---|
| Coefficient of variation (CV) | 0.58440666 |
| Kurtosis | -1.2065094 |
| Mean | 29.544399 |
| Median Absolute Deviation (MAD) | 15 |
| Skewness | -0.0037686619 |
| Sum | 10430207 |
| Variance | 298.11281 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 44 | 6284 | 1.8% |
| 23 | 6182 | 1.8% |
| 24 | 6181 | 1.8% |
| 43 | 6162 | 1.7% |
| 25 | 6146 | 1.7% |
| 53 | 6122 | 1.7% |
| 45 | 6092 | 1.7% |
| 41 | 6072 | 1.7% |
| 11 | 6048 | 1.7% |
| 22 | 6044 | 1.7% |
| Other values (50) | 291702 |
| Value | Count | Frequency (%) |
| 0 | 5390 | |
| 1 | 5655 | |
| 2 | 5712 | |
| 3 | 5913 | |
| 4 | 5929 | |
| 5 | 5893 | |
| 6 | 6007 | |
| 7 | 5724 | |
| 8 | 5932 | |
| 9 | 5938 |
| Value | Count | Frequency (%) |
| 59 | 5559 | |
| 58 | 5558 | |
| 57 | 5811 | |
| 56 | 5944 | |
| 55 | 5801 | |
| 54 | 5860 | |
| 53 | 6122 | |
| 52 | 5880 | |
| 51 | 5967 | |
| 50 | 5990 |
birthyear
Real number (ℝ)
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1980.4391 |
| Minimum | 1939 |
|---|---|
| Maximum | 2002 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 1939 |
|---|---|
| 5-th percentile | 1960 |
| Q1 | 1974 |
| median | 1983 |
| Q3 | 1988 |
| 95-th percentile | 1993 |
| Maximum | 2002 |
| Range | 63 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 10.09409 |
|---|---|
| Coefficient of variation (CV) | 0.005096895 |
| Kurtosis | -0.014329794 |
| Mean | 1980.4391 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.80006489 |
| Sum | 6.9916432 × 108 |
| Variance | 101.89066 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1969 | 22215 | 6.3% |
| 1988 | 21646 | 6.1% |
| 1986 | 19023 | 5.4% |
| 1989 | 17852 | 5.1% |
| 1987 | 16566 | 4.7% |
| 1990 | 15322 | 4.3% |
| 1985 | 14916 | 4.2% |
| 1983 | 14488 | 4.1% |
| 1984 | 14392 | 4.1% |
| 1991 | 13872 | 3.9% |
| Other values (54) | 182743 |
| Value | Count | Frequency (%) |
| 1939 | 2 | < 0.1% |
| 1940 | 3 | < 0.1% |
| 1941 | 132 | |
| 1942 | 9 | < 0.1% |
| 1943 | 2 | < 0.1% |
| 1944 | 73 | |
| 1945 | 4 | < 0.1% |
| 1946 | 15 | < 0.1% |
| 1947 | 41 | < 0.1% |
| 1948 | 12 | < 0.1% |
| Value | Count | Frequency (%) |
| 2002 | 17 | < 0.1% |
| 2001 | 22 | < 0.1% |
| 2000 | 198 | 0.1% |
| 1999 | 162 | < 0.1% |
| 1998 | 519 | 0.1% |
| 1997 | 434 | 0.1% |
| 1996 | 1834 | 0.5% |
| 1995 | 3654 | |
| 1994 | 7831 | |
| 1993 | 8314 |
years_old
Real number (ℝ)
| Distinct | 64 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 37.560888 |
| Minimum | 16 |
|---|---|
| Maximum | 79 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 16 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 30 |
| median | 35 |
| Q3 | 44 |
| 95-th percentile | 58 |
| Maximum | 79 |
| Range | 63 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 10.09409 |
|---|---|
| Coefficient of variation (CV) | 0.26873939 |
| Kurtosis | -0.014329794 |
| Mean | 37.560888 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | 0.80006489 |
| Sum | 13260308 |
| Variance | 101.89066 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 49 | 22215 | 6.3% |
| 30 | 21646 | 6.1% |
| 32 | 19023 | 5.4% |
| 29 | 17852 | 5.1% |
| 31 | 16566 | 4.7% |
| 28 | 15322 | 4.3% |
| 33 | 14916 | 4.2% |
| 35 | 14488 | 4.1% |
| 34 | 14392 | 4.1% |
| 27 | 13872 | 3.9% |
| Other values (54) | 182743 |
| Value | Count | Frequency (%) |
| 16 | 17 | < 0.1% |
| 17 | 22 | < 0.1% |
| 18 | 198 | 0.1% |
| 19 | 162 | < 0.1% |
| 20 | 519 | 0.1% |
| 21 | 434 | 0.1% |
| 22 | 1834 | 0.5% |
| 23 | 3654 | |
| 24 | 7831 | |
| 25 | 8314 |
| Value | Count | Frequency (%) |
| 79 | 2 | < 0.1% |
| 78 | 3 | < 0.1% |
| 77 | 132 | |
| 76 | 9 | < 0.1% |
| 75 | 2 | < 0.1% |
| 74 | 73 | |
| 73 | 4 | < 0.1% |
| 72 | 15 | < 0.1% |
| 71 | 41 | < 0.1% |
| 70 | 12 | < 0.1% |
holiday
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
| working_day | |
|---|---|
| holiday | 7451 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 10.915578 |
| Min length | 7 |
Characters and Unicode
| Total characters | 3853581 |
|---|---|
| Distinct characters | 13 |
| Distinct categories | 2 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | holiday |
|---|---|
| 2nd row | holiday |
| 3rd row | holiday |
| 4th row | holiday |
| 5th row | holiday |
Common Values
| Value | Count | Frequency (%) |
| working_day | 345584 | |
| holiday | 7451 | 2.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| working_day | 345584 | |
| holiday | 7451 | 2.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 353035 | |
| i | 353035 | |
| d | 353035 | |
| a | 353035 | |
| y | 353035 | |
| w | 345584 | |
| r | 345584 | |
| k | 345584 | |
| n | 345584 | |
| g | 345584 | |
| Other values (3) | 360486 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3507997 | |
| Connector Punctuation | 345584 | 9.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 353035 | |
| i | 353035 | |
| d | 353035 | |
| a | 353035 | |
| y | 353035 | |
| w | 345584 | |
| r | 345584 | |
| k | 345584 | |
| n | 345584 | |
| g | 345584 | |
| Other values (2) | 14902 | 0.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 345584 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3507997 | |
| Common | 345584 | 9.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 353035 | |
| i | 353035 | |
| d | 353035 | |
| a | 353035 | |
| y | 353035 | |
| w | 345584 | |
| r | 345584 | |
| k | 345584 | |
| n | 345584 | |
| g | 345584 | |
| Other values (2) | 14902 | 0.4% |
Common
| Value | Count | Frequency (%) |
| _ | 345584 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3853581 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 353035 | |
| i | 353035 | |
| d | 353035 | |
| a | 353035 | |
| y | 353035 | |
| w | 345584 | |
| r | 345584 | |
| k | 345584 | |
| n | 345584 | |
| g | 345584 | |
| Other values (3) | 360486 |
tempmax
Real number (ℝ)
| Distinct | 116 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.393652 |
| Minimum | -11.1 |
|---|---|
| Maximum | 35 |
| Zeros | 311 |
| Zeros (%) | 0.1% |
| Negative | 2581 |
| Negative (%) | 0.7% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | -11.1 |
|---|---|
| 5-th percentile | 3.9 |
| Q1 | 12.9 |
| median | 22.9 |
| Q3 | 27.8 |
| 95-th percentile | 32.2 |
| Maximum | 35 |
| Range | 46.1 |
| Interquartile range (IQR) | 14.9 |
Descriptive statistics
| Standard deviation | 9.1360839 |
|---|---|
| Coefficient of variation (CV) | 0.44798666 |
| Kurtosis | -0.83365281 |
| Mean | 20.393652 |
| Median Absolute Deviation (MAD) | 6.2 |
| Skewness | -0.53979696 |
| Sum | 7199672.9 |
| Variance | 83.46803 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 28.4 | 13699 | 3.9% |
| 26.7 | 13189 | 3.7% |
| 27.2 | 10677 | 3.0% |
| 26.2 | 9536 | 2.7% |
| 25.1 | 8658 | 2.5% |
| 21.7 | 8106 | 2.3% |
| 31.7 | 8078 | 2.3% |
| 22.2 | 7624 | 2.2% |
| 31.2 | 7382 | 2.1% |
| 30.7 | 7273 | 2.1% |
| Other values (106) | 258813 |
| Value | Count | Frequency (%) |
| -11.1 | 75 | < 0.1% |
| -7.8 | 191 | |
| -7.1 | 120 | < 0.1% |
| -4.4 | 191 | |
| -3.3 | 391 | |
| -2.8 | 175 | < 0.1% |
| -1.7 | 52 | < 0.1% |
| -1.6 | 467 | |
| -1.2 | 296 | |
| -1.1 | 301 |
| Value | Count | Frequency (%) |
| 35 | 2296 | 0.7% |
| 33.9 | 1600 | 0.5% |
| 33.4 | 4404 | |
| 33.3 | 1237 | 0.4% |
| 32.9 | 4294 | |
| 32.2 | 4796 | |
| 31.7 | 8078 | |
| 31.2 | 7382 | |
| 31.1 | 1425 | 0.4% |
| 30.7 | 7273 |
| Distinct | 112 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 13.408749 |
| Minimum | -15 |
|---|---|
| Maximum | 27.2 |
| Zeros | 4161 |
| Zeros (%) | 1.2% |
| Negative | 25652 |
| Negative (%) | 7.3% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | -15 |
|---|---|
| 5-th percentile | -1.7 |
| Q1 | 6.1 |
| median | 15.7 |
| Q3 | 20.7 |
| 95-th percentile | 24.4 |
| Maximum | 27.2 |
| Range | 42.2 |
| Interquartile range (IQR) | 14.6 |
Descriptive statistics
| Standard deviation | 8.7414052 |
|---|---|
| Coefficient of variation (CV) | 0.65191801 |
| Kurtosis | -0.83557998 |
| Mean | 13.408749 |
| Median Absolute Deviation (MAD) | 6.5 |
| Skewness | -0.51587694 |
| Sum | 4733757.8 |
| Variance | 76.412165 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20.7 | 17943 | 5.1% |
| 21.7 | 14211 | 4.0% |
| 19.4 | 12439 | 3.5% |
| 22.9 | 11145 | 3.2% |
| 20.1 | 9991 | 2.8% |
| 18.9 | 9435 | 2.7% |
| 24.4 | 9227 | 2.6% |
| 23.4 | 9155 | 2.6% |
| 17.2 | 8671 | 2.5% |
| 22.2 | 8270 | 2.3% |
| Other values (102) | 242548 |
| Value | Count | Frequency (%) |
| -15 | 90 | < 0.1% |
| -13.8 | 176 | < 0.1% |
| -12.8 | 120 | < 0.1% |
| -10.9 | 391 | |
| -9.3 | 502 | |
| -8.8 | 597 | |
| -8.4 | 467 | |
| -8.3 | 816 | |
| -7.8 | 699 | |
| -7.1 | 469 |
| Value | Count | Frequency (%) |
| 27.2 | 1593 | 0.5% |
| 26.6 | 1442 | 0.4% |
| 26.2 | 3167 | 0.9% |
| 26.1 | 854 | 0.2% |
| 25.7 | 2895 | 0.8% |
| 25.1 | 3034 | 0.9% |
| 25 | 2522 | 0.7% |
| 24.4 | 9227 | |
| 23.9 | 1631 | 0.5% |
| 23.4 | 9155 |
temp
Real number (ℝ)
| Distinct | 234 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.765239 |
| Minimum | -12.4 |
|---|---|
| Maximum | 30.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 9225 |
| Negative (%) | 2.6% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | -12.4 |
|---|---|
| 5-th percentile | 1.5 |
| Q1 | 9.1 |
| median | 19.5 |
| Q3 | 23.5 |
| 95-th percentile | 27.8 |
| Maximum | 30.5 |
| Range | 42.9 |
| Interquartile range (IQR) | 14.4 |
Descriptive statistics
| Standard deviation | 8.8048127 |
|---|---|
| Coefficient of variation (CV) | 0.52518267 |
| Kurtosis | -0.84871713 |
| Mean | 16.765239 |
| Median Absolute Deviation (MAD) | 6 |
| Skewness | -0.53961047 |
| Sum | 5918716 |
| Variance | 77.524727 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23.4 | 9434 | 2.7% |
| 17.1 | 6581 | 1.9% |
| 23.2 | 5977 | 1.7% |
| 20.8 | 5661 | 1.6% |
| 27.7 | 4674 | 1.3% |
| 27.8 | 4551 | 1.3% |
| 23.3 | 4549 | 1.3% |
| 23 | 4465 | 1.3% |
| 23.1 | 4459 | 1.3% |
| 24.6 | 4351 | 1.2% |
| Other values (224) | 298333 |
| Value | Count | Frequency (%) |
| -12.4 | 75 | < 0.1% |
| -11.4 | 90 | < 0.1% |
| -10.7 | 101 | < 0.1% |
| -10.4 | 120 | < 0.1% |
| -7.5 | 391 | |
| -7.4 | 191 | 0.1% |
| -5.2 | 296 | |
| -5 | 642 | |
| -4.6 | 301 | |
| -4.3 | 311 |
| Value | Count | Frequency (%) |
| 30.5 | 854 | 0.2% |
| 30.3 | 1593 | 0.5% |
| 30 | 3042 | |
| 29.1 | 1658 | 0.5% |
| 28.8 | 1414 | 0.4% |
| 28.7 | 1567 | 0.4% |
| 28.6 | 1043 | 0.3% |
| 27.9 | 2864 | |
| 27.8 | 4551 | |
| 27.7 | 4674 |
feelslike
Real number (ℝ)
| Distinct | 227 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 16.189142 |
| Minimum | -20.1 |
|---|---|
| Maximum | 33.5 |
| Zeros | 896 |
| Zeros (%) | 0.3% |
| Negative | 34439 |
| Negative (%) | 9.8% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | -20.1 |
|---|---|
| 5-th percentile | -2.3 |
| Q1 | 7.4 |
| median | 19.5 |
| Q3 | 23.8 |
| 95-th percentile | 29.6 |
| Maximum | 33.5 |
| Range | 53.6 |
| Interquartile range (IQR) | 16.4 |
Descriptive statistics
| Standard deviation | 10.310905 |
|---|---|
| Coefficient of variation (CV) | 0.6369025 |
| Kurtosis | -0.65784668 |
| Mean | 16.189142 |
| Median Absolute Deviation (MAD) | 6.6 |
| Skewness | -0.6041219 |
| Sum | 5715333.6 |
| Variance | 106.31476 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 7174 | 2.0% |
| 23.3 | 7011 | 2.0% |
| 23.5 | 6928 | 2.0% |
| 17.1 | 6581 | 1.9% |
| 22.2 | 5352 | 1.5% |
| 24.1 | 4831 | 1.4% |
| 20.8 | 4638 | 1.3% |
| 24.5 | 4532 | 1.3% |
| 23.1 | 4459 | 1.3% |
| 26.7 | 4372 | 1.2% |
| Other values (217) | 297157 |
| Value | Count | Frequency (%) |
| -20.1 | 75 | < 0.1% |
| -18.7 | 120 | < 0.1% |
| -17.1 | 90 | < 0.1% |
| -17 | 101 | < 0.1% |
| -13 | 582 | |
| -12.7 | 175 | < 0.1% |
| -10.4 | 296 | |
| -10.3 | 52 | < 0.1% |
| -9.6 | 301 | |
| -8.7 | 311 |
| Value | Count | Frequency (%) |
| 33.5 | 1593 | |
| 33 | 1600 | |
| 32.7 | 854 | |
| 32.4 | 1442 | |
| 31.3 | 1658 | |
| 30.8 | 1414 | |
| 30.6 | 1567 | |
| 30.3 | 1237 | |
| 30.1 | 1495 | |
| 30 | 1025 |
| Distinct | 166 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.5119011 |
| Minimum | 0 |
|---|---|
| Maximum | 72.817 |
| Zeros | 200641 |
| Zeros (%) | 56.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 2.083 |
| 95-th percentile | 20.058 |
| Maximum | 72.817 |
| Range | 72.817 |
| Interquartile range (IQR) | 2.083 |
Descriptive statistics
| Standard deviation | 8.628572 |
|---|---|
| Coefficient of variation (CV) | 2.4569519 |
| Kurtosis | 20.813467 |
| Mean | 3.5119011 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 4.0030415 |
| Sum | 1239824 |
| Variance | 74.452254 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 200641 | |
| 0.002 | 4579 | 1.3% |
| 0.295 | 1846 | 0.5% |
| 16.289 | 1715 | 0.5% |
| 0.297 | 1699 | 0.5% |
| 1.174 | 1697 | 0.5% |
| 15.43 | 1682 | 0.5% |
| 0.014 | 1682 | 0.5% |
| 1.095 | 1671 | 0.5% |
| 20.842 | 1667 | 0.5% |
| Other values (156) | 134156 |
| Value | Count | Frequency (%) |
| 0 | 200641 | |
| 0.002 | 4579 | 1.3% |
| 0.004 | 807 | 0.2% |
| 0.005 | 1626 | 0.5% |
| 0.007 | 1620 | 0.5% |
| 0.014 | 1682 | 0.5% |
| 0.016 | 379 | 0.1% |
| 0.018 | 1553 | 0.4% |
| 0.038 | 1090 | 0.3% |
| 0.054 | 726 | 0.2% |
| Value | Count | Frequency (%) |
| 72.817 | 467 | 0.1% |
| 71.84 | 732 | |
| 56.579 | 1215 | |
| 49.725 | 130 | < 0.1% |
| 49.419 | 532 | |
| 41.371 | 457 | 0.1% |
| 38.722 | 901 | |
| 35.679 | 147 | < 0.1% |
| 35.163 | 189 | 0.1% |
| 34.533 | 1152 |
dew
Real number (ℝ)
| Distinct | 237 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.8289507 |
| Minimum | -23.1 |
|---|---|
| Maximum | 23.7 |
| Zeros | 707 |
| Zeros (%) | 0.2% |
| Negative | 77284 |
| Negative (%) | 21.9% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | -23.1 |
|---|---|
| 5-th percentile | -8 |
| Q1 | 1.7 |
| median | 12.1 |
| Q3 | 18.2 |
| 95-th percentile | 22.2 |
| Maximum | 23.7 |
| Range | 46.8 |
| Interquartile range (IQR) | 16.5 |
Descriptive statistics
| Standard deviation | 10.027654 |
|---|---|
| Coefficient of variation (CV) | 1.0202161 |
| Kurtosis | -0.74225288 |
| Mean | 9.8289507 |
| Median Absolute Deviation (MAD) | 7.4 |
| Skewness | -0.58868835 |
| Sum | 3469963.6 |
| Variance | 100.55384 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 20 | 6000 | 1.7% |
| 13.3 | 5682 | 1.6% |
| 21.7 | 5199 | 1.5% |
| 17.1 | 5187 | 1.5% |
| 22.8 | 5138 | 1.5% |
| 22.2 | 4699 | 1.3% |
| 11.7 | 4517 | 1.3% |
| 21.2 | 4348 | 1.2% |
| -0.4 | 4135 | 1.2% |
| 19.5 | 4074 | 1.2% |
| Other values (227) | 304056 |
| Value | Count | Frequency (%) |
| -23.1 | 75 | < 0.1% |
| -20.7 | 120 | < 0.1% |
| -20.4 | 90 | < 0.1% |
| -19.9 | 101 | < 0.1% |
| -18 | 191 | 0.1% |
| -17.6 | 175 | < 0.1% |
| -16.1 | 778 | |
| -15.7 | 391 | |
| -15.3 | 883 | |
| -14.9 | 381 |
| Value | Count | Frequency (%) |
| 23.7 | 1237 | 0.4% |
| 23 | 1212 | 0.3% |
| 22.9 | 3092 | |
| 22.8 | 5138 | |
| 22.6 | 1179 | 0.3% |
| 22.5 | 1512 | 0.4% |
| 22.4 | 1343 | 0.4% |
| 22.2 | 4699 | |
| 22.1 | 2815 | |
| 22 | 1308 | 0.4% |
humidity
Real number (ℝ)
| Distinct | 276 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 66.231963 |
| Minimum | 22.6 |
|---|---|
| Maximum | 96 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 22.6 |
|---|---|
| 5-th percentile | 41.2 |
| Q1 | 56 |
| median | 66.8 |
| Q3 | 77.6 |
| 95-th percentile | 89.2 |
| Maximum | 96 |
| Range | 73.4 |
| Interquartile range (IQR) | 21.6 |
Descriptive statistics
| Standard deviation | 14.814256 |
|---|---|
| Coefficient of variation (CV) | 0.22367231 |
| Kurtosis | -0.57956187 |
| Mean | 66.231963 |
| Median Absolute Deviation (MAD) | 10.8 |
| Skewness | -0.19742869 |
| Sum | 23382201 |
| Variance | 219.46218 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 54.7 | 4472 | 1.3% |
| 59.2 | 4432 | 1.3% |
| 79.1 | 3988 | 1.1% |
| 56.6 | 3755 | 1.1% |
| 75.5 | 3625 | 1.0% |
| 64.1 | 3593 | 1.0% |
| 66.8 | 3544 | 1.0% |
| 50.3 | 3435 | 1.0% |
| 62.8 | 3353 | 0.9% |
| 77.3 | 3233 | 0.9% |
| Other values (266) | 315605 |
| Value | Count | Frequency (%) |
| 22.6 | 1274 | |
| 25.8 | 983 | |
| 28.7 | 381 | 0.1% |
| 33 | 738 | |
| 34 | 771 | |
| 34.2 | 1219 | |
| 35.1 | 630 | |
| 36.1 | 966 | |
| 36.3 | 1109 | |
| 36.9 | 437 | 0.1% |
| Value | Count | Frequency (%) |
| 96 | 151 | < 0.1% |
| 95.8 | 1023 | 0.3% |
| 95.1 | 375 | 0.1% |
| 95 | 301 | 0.1% |
| 94.4 | 1397 | |
| 94.1 | 457 | 0.1% |
| 92.7 | 2720 | |
| 92.2 | 974 | 0.3% |
| 91.7 | 1224 | |
| 91.5 | 2520 |
| Distinct | 16 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.075514609 |
| Minimum | 0 |
|---|---|
| Maximum | 12.5 |
| Zeros | 345872 |
| Zeros (%) | 98.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 12.5 |
| Range | 12.5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.62919415 |
|---|---|
| Coefficient of variation (CV) | 8.3320851 |
| Kurtosis | 126.86002 |
| Mean | 0.075514609 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 10.452459 |
| Sum | 26659.3 |
| Variance | 0.39588528 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 345872 | |
| 1.9 | 1659 | 0.5% |
| 3 | 650 | 0.2% |
| 3.7 | 628 | 0.2% |
| 1.5 | 562 | 0.2% |
| 6.1 | 541 | 0.2% |
| 8.1 | 492 | 0.1% |
| 1.2 | 485 | 0.1% |
| 5 | 476 | 0.1% |
| 1 | 469 | 0.1% |
| Other values (6) | 1201 | 0.3% |
| Value | Count | Frequency (%) |
| 0 | 345872 | |
| 1 | 469 | 0.1% |
| 1.2 | 485 | 0.1% |
| 1.5 | 562 | 0.2% |
| 1.9 | 1659 | 0.5% |
| 3 | 650 | 0.2% |
| 3.7 | 628 | 0.2% |
| 4.9 | 396 | 0.1% |
| 5 | 476 | 0.1% |
| 5.7 | 120 | < 0.1% |
| Value | Count | Frequency (%) |
| 12.5 | 91 | < 0.1% |
| 10.6 | 52 | < 0.1% |
| 8.1 | 492 | |
| 7.5 | 128 | < 0.1% |
| 6.1 | 541 | |
| 5.8 | 414 | |
| 5.7 | 120 | < 0.1% |
| 5 | 476 | |
| 4.9 | 396 | |
| 3.7 | 628 |
| Distinct | 29 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.15625306 |
| Minimum | 0 |
|---|---|
| Maximum | 16.4 |
| Zeros | 336604 |
| Zeros (%) | 95.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 16.4 |
| Range | 16.4 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 1.1091937 |
|---|---|
| Coefficient of variation (CV) | 7.0987004 |
| Kurtosis | 96.394366 |
| Mean | 0.15625306 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 9.2708642 |
| Sum | 55162.8 |
| Variance | 1.2303106 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 336604 | |
| 0.1 | 2570 | 0.7% |
| 0.6 | 1627 | 0.5% |
| 1.8 | 1047 | 0.3% |
| 0.8 | 1041 | 0.3% |
| 0.5 | 762 | 0.2% |
| 3.4 | 691 | 0.2% |
| 4.3 | 632 | 0.2% |
| 8.3 | 628 | 0.2% |
| 0.2 | 616 | 0.2% |
| Other values (19) | 6817 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 336604 | |
| 0.1 | 2570 | 0.7% |
| 0.2 | 616 | 0.2% |
| 0.3 | 560 | 0.2% |
| 0.5 | 762 | 0.2% |
| 0.6 | 1627 | 0.5% |
| 0.8 | 1041 | 0.3% |
| 1.2 | 384 | 0.1% |
| 1.7 | 476 | 0.1% |
| 1.8 | 1047 | 0.3% |
| Value | Count | Frequency (%) |
| 16.4 | 120 | < 0.1% |
| 15.2 | 203 | 0.1% |
| 14.6 | 90 | < 0.1% |
| 13.1 | 322 | |
| 12.3 | 414 | |
| 9.7 | 509 | |
| 9.5 | 396 | |
| 8.3 | 628 | |
| 6.8 | 304 | |
| 6.4 | 579 |
windspeed
Real number (ℝ)
| Distinct | 162 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 20.475578 |
| Minimum | 5.6 |
|---|---|
| Maximum | 49.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 5.6 |
|---|---|
| 5-th percentile | 9.4 |
| Q1 | 14.7 |
| median | 19.9 |
| Q3 | 25.3 |
| 95-th percentile | 36 |
| Maximum | 49.5 |
| Range | 43.9 |
| Interquartile range (IQR) | 10.6 |
Descriptive statistics
| Standard deviation | 8.181812 |
|---|---|
| Coefficient of variation (CV) | 0.39958882 |
| Kurtosis | 0.31816553 |
| Mean | 20.475578 |
| Median Absolute Deviation (MAD) | 5.2 |
| Skewness | 0.66677106 |
| Sum | 7228595.7 |
| Variance | 66.942047 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 21.3 | 10061 | 2.8% |
| 22.6 | 9385 | 2.7% |
| 17.9 | 8511 | 2.4% |
| 18.4 | 8475 | 2.4% |
| 9.4 | 7950 | 2.3% |
| 7.8 | 6524 | 1.8% |
| 13 | 6518 | 1.8% |
| 17.4 | 6358 | 1.8% |
| 15.6 | 6132 | 1.7% |
| 21.9 | 6008 | 1.7% |
| Other values (152) | 277113 |
| Value | Count | Frequency (%) |
| 5.6 | 1238 | 0.4% |
| 5.7 | 1751 | 0.5% |
| 7.6 | 2529 | 0.7% |
| 7.7 | 2259 | 0.6% |
| 7.8 | 6524 | |
| 7.9 | 1610 | 0.5% |
| 9.3 | 1414 | 0.4% |
| 9.4 | 7950 | |
| 9.5 | 4362 | |
| 9.6 | 3092 | 0.9% |
| Value | Count | Frequency (%) |
| 49.5 | 52 | < 0.1% |
| 48.9 | 812 | |
| 47 | 205 | 0.1% |
| 45.4 | 477 | 0.1% |
| 44.3 | 494 | 0.1% |
| 44 | 722 | |
| 43.5 | 607 | 0.2% |
| 43.1 | 1070 | |
| 42.5 | 1048 | |
| 42.1 | 1706 |
visibility
Real number (ℝ)
| Distinct | 75 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.495764 |
| Minimum | 4.2 |
|---|---|
| Maximum | 16 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 4.2 |
|---|---|
| 5-th percentile | 9.7 |
| Q1 | 13.6 |
| median | 15.6 |
| Q3 | 16 |
| 95-th percentile | 16 |
| Maximum | 16 |
| Range | 11.8 |
| Interquartile range (IQR) | 2.4 |
Descriptive statistics
| Standard deviation | 2.1149088 |
|---|---|
| Coefficient of variation (CV) | 0.14589841 |
| Kurtosis | 2.3367026 |
| Mean | 14.495764 |
| Median Absolute Deviation (MAD) | 0.4 |
| Skewness | -1.6745384 |
| Sum | 5117511.9 |
| Variance | 4.4728394 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 16 | 136676 | |
| 15.8 | 12285 | 3.5% |
| 15.9 | 10999 | 3.1% |
| 15.6 | 10782 | 3.1% |
| 15.5 | 7688 | 2.2% |
| 13.5 | 7626 | 2.2% |
| 14.6 | 7623 | 2.2% |
| 15.3 | 7475 | 2.1% |
| 14.4 | 7150 | 2.0% |
| 15.4 | 6644 | 1.9% |
| Other values (65) | 138087 |
| Value | Count | Frequency (%) |
| 4.2 | 301 | 0.1% |
| 5.6 | 91 | < 0.1% |
| 6.4 | 151 | < 0.1% |
| 7.1 | 2005 | |
| 7.2 | 370 | 0.1% |
| 7.3 | 147 | < 0.1% |
| 7.5 | 662 | 0.2% |
| 7.6 | 2206 | |
| 7.7 | 250 | 0.1% |
| 7.8 | 940 |
| Value | Count | Frequency (%) |
| 16 | 136676 | |
| 15.9 | 10999 | 3.1% |
| 15.8 | 12285 | 3.5% |
| 15.7 | 6390 | 1.8% |
| 15.6 | 10782 | 3.1% |
| 15.5 | 7688 | 2.2% |
| 15.4 | 6644 | 1.9% |
| 15.3 | 7475 | 2.1% |
| 15.2 | 2144 | 0.6% |
| 15.1 | 4850 | 1.4% |
solarradiation
Real number (ℝ)
| Distinct | 345 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 167.47613 |
| Minimum | 11 |
|---|---|
| Maximum | 331.4 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 11 |
|---|---|
| 5-th percentile | 33.2 |
| Q1 | 84.1 |
| median | 162.5 |
| Q3 | 242.8 |
| 95-th percentile | 315.6 |
| Maximum | 331.4 |
| Range | 320.4 |
| Interquartile range (IQR) | 158.7 |
Descriptive statistics
| Standard deviation | 90.765205 |
|---|---|
| Coefficient of variation (CV) | 0.54195904 |
| Kurtosis | -1.2138182 |
| Mean | 167.47613 |
| Median Absolute Deviation (MAD) | 80.1 |
| Skewness | 0.1233206 |
| Sum | 59124937 |
| Variance | 8238.3224 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 208.7 | 3060 | 0.9% |
| 242.6 | 2984 | 0.8% |
| 268.7 | 2664 | 0.8% |
| 273.5 | 2577 | 0.7% |
| 52.5 | 2455 | 0.7% |
| 77.5 | 2257 | 0.6% |
| 102.7 | 1946 | 0.6% |
| 81.5 | 1820 | 0.5% |
| 315.6 | 1788 | 0.5% |
| 232.4 | 1753 | 0.5% |
| Other values (335) | 329731 |
| Value | Count | Frequency (%) |
| 11 | 375 | 0.1% |
| 11.1 | 52 | < 0.1% |
| 12.4 | 250 | 0.1% |
| 12.5 | 189 | 0.1% |
| 13.6 | 313 | 0.1% |
| 14.1 | 1023 | |
| 14.5 | 130 | < 0.1% |
| 15.2 | 457 | |
| 17.1 | 147 | < 0.1% |
| 19 | 872 |
| Value | Count | Frequency (%) |
| 331.4 | 1113 | |
| 327.5 | 1147 | |
| 327.2 | 1616 | |
| 326 | 1598 | |
| 325.2 | 1150 | |
| 323.9 | 1477 | |
| 321.9 | 1549 | |
| 319.2 | 1653 | |
| 317.6 | 1043 | |
| 317.4 | 1219 |
cloudcover
Real number (ℝ)
| Distinct | 276 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 45.742566 |
| Minimum | 0.1 |
|---|---|
| Maximum | 100 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.7 MiB |
Quantile statistics
| Minimum | 0.1 |
|---|---|
| 5-th percentile | 0.6 |
| Q1 | 17.3 |
| median | 43.7 |
| Q3 | 74 |
| 95-th percentile | 99.5 |
| Maximum | 100 |
| Range | 99.9 |
| Interquartile range (IQR) | 56.7 |
Descriptive statistics
| Standard deviation | 32.567541 |
|---|---|
| Coefficient of variation (CV) | 0.71197452 |
| Kurtosis | -1.2347865 |
| Mean | 45.742566 |
| Median Absolute Deviation (MAD) | 28.1 |
| Skewness | 0.19209673 |
| Sum | 16148727 |
| Variance | 1060.6447 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 100 | 14230 | 4.0% |
| 0.7 | 5799 | 1.6% |
| 0.5 | 5015 | 1.4% |
| 0.4 | 4420 | 1.3% |
| 0.9 | 4320 | 1.2% |
| 66.8 | 4093 | 1.2% |
| 99.5 | 4077 | 1.2% |
| 65.4 | 4055 | 1.1% |
| 0.6 | 3805 | 1.1% |
| 66.4 | 3142 | 0.9% |
| Other values (266) | 300079 |
| Value | Count | Frequency (%) |
| 0.1 | 381 | 0.1% |
| 0.2 | 2511 | |
| 0.3 | 2156 | 0.6% |
| 0.4 | 4420 | |
| 0.5 | 5015 | |
| 0.6 | 3805 | |
| 0.7 | 5799 | |
| 0.9 | 4320 | |
| 1 | 1934 | 0.5% |
| 1.1 | 2776 |
| Value | Count | Frequency (%) |
| 100 | 14230 | |
| 99.9 | 1012 | 0.3% |
| 99.8 | 1212 | 0.3% |
| 99.6 | 250 | 0.1% |
| 99.5 | 4077 | 1.2% |
| 99.1 | 1928 | 0.5% |
| 98.7 | 379 | 0.1% |
| 98.1 | 1631 | 0.5% |
| 97.9 | 1231 | 0.3% |
| 97.8 | 1224 | 0.3% |
conditions
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
| cloudy_rain | |
|---|---|
| Clear | |
| snow_rain | 6735 |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 9.4058748 |
| Min length | 5 |
Characters and Unicode
| Total characters | 3320603 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Clear |
|---|---|
| 2nd row | Clear |
| 3rd row | Clear |
| 4th row | Clear |
| 5th row | Clear |
Common Values
| Value | Count | Frequency (%) |
| cloudy_rain | 254748 | |
| Clear | 91552 | 25.9% |
| snow_rain | 6735 | 1.9% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cloudy_rain | 254748 | |
| clear | 91552 | 25.9% |
| snow_rain | 6735 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 353035 | |
| a | 353035 | |
| l | 346300 | |
| n | 268218 | |
| o | 261483 | |
| _ | 261483 | |
| i | 261483 | |
| c | 254748 | |
| u | 254748 | |
| d | 254748 | |
| Other values (5) | 451322 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2967568 | |
| Connector Punctuation | 261483 | 7.9% |
| Uppercase Letter | 91552 | 2.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 353035 | |
| a | 353035 | |
| l | 346300 | |
| n | 268218 | |
| o | 261483 | |
| i | 261483 | |
| c | 254748 | |
| u | 254748 | |
| d | 254748 | |
| y | 254748 | |
| Other values (3) | 105022 | 3.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 261483 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 91552 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3059120 | |
| Common | 261483 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 353035 | |
| a | 353035 | |
| l | 346300 | |
| n | 268218 | |
| o | 261483 | |
| i | 261483 | |
| c | 254748 | |
| u | 254748 | |
| d | 254748 | |
| y | 254748 | |
| Other values (4) | 196574 |
Common
| Value | Count | Frequency (%) |
| _ | 261483 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3320603 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 353035 | |
| a | 353035 | |
| l | 346300 | |
| n | 268218 | |
| o | 261483 | |
| _ | 261483 | |
| i | 261483 | |
| c | 254748 | |
| u | 254748 | |
| d | 254748 | |
| Other values (5) | 451322 |
description
Categorical
| Distinct | 44 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
| Clear conditions throughout the day. | |
|---|---|
| Partly cloudy throughout the day. | |
| Becoming cloudy in the afternoon. | |
| Partly cloudy throughout the day with late afternoon rain. | |
| Partly cloudy throughout the day with rain. | |
| Other values (39) |
Length
| Max length | 82 |
|---|---|
| Median length | 81 |
| Mean length | 43.951506 |
| Min length | 26 |
Characters and Unicode
| Total characters | 15516420 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Clear conditions throughout the day. |
|---|---|
| 2nd row | Clear conditions throughout the day. |
| 3rd row | Clear conditions throughout the day. |
| 4th row | Clear conditions throughout the day. |
| 5th row | Clear conditions throughout the day. |
Common Values
| Value | Count | Frequency (%) |
| Clear conditions throughout the day. | 91552 | |
| Partly cloudy throughout the day. | 65086 | |
| Becoming cloudy in the afternoon. | 22155 | 6.3% |
| Partly cloudy throughout the day with late afternoon rain. | 18353 | 5.2% |
| Partly cloudy throughout the day with rain. | 16268 | 4.6% |
| Partly cloudy throughout the day with rain clearing later. | 13963 | 4.0% |
| Cloudy skies throughout the day. | 12706 | 3.6% |
| Cloudy skies throughout the day with a chance of rain throughout the day. | 11247 | 3.2% |
| Partly cloudy throughout the day with a chance of rain throughout the day. | 9267 | 2.6% |
| Clearing in the afternoon. | 9142 | 2.6% |
| Other values (34) | 83296 |
Length
| Value | Count | Frequency (%) |
| the | 382040 | |
| throughout | 316485 | |
| day | 316485 | |
| cloudy | 235460 | |
| with | 152394 | 6.3% |
| rain | 152147 | 6.3% |
| partly | 149674 | 6.2% |
| afternoon | 108961 | 4.5% |
| conditions | 97515 | 4.1% |
| clear | 97515 | 4.1% |
| Other values (14) | 396686 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2052327 | ||
| t | 1578125 | 10.2% |
| o | 1397362 | 9.0% |
| h | 1190906 | 7.7% |
| a | 994139 | 6.4% |
| r | 949782 | 6.1% |
| u | 868430 | 5.6% |
| n | 820818 | 5.3% |
| e | 814654 | 5.3% |
| i | 726643 | 4.7% |
| Other values (14) | 4123234 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12758023 | |
| Space Separator | 2052327 | 13.2% |
| Other Punctuation | 353035 | 2.3% |
| Uppercase Letter | 353035 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1578125 | |
| o | 1397362 | |
| h | 1190906 | |
| a | 994139 | 7.8% |
| r | 949782 | 7.4% |
| u | 868430 | 6.8% |
| n | 820818 | 6.4% |
| e | 814654 | 6.4% |
| i | 726643 | 5.7% |
| y | 726306 | 5.7% |
| Other values (9) | 2690858 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 163369 | |
| P | 149674 | |
| B | 39992 | 11.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2052327 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 353035 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13111058 | |
| Common | 2405362 | 15.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1578125 | |
| o | 1397362 | |
| h | 1190906 | 9.1% |
| a | 994139 | 7.6% |
| r | 949782 | 7.2% |
| u | 868430 | 6.6% |
| n | 820818 | 6.3% |
| e | 814654 | 6.2% |
| i | 726643 | 5.5% |
| y | 726306 | 5.5% |
| Other values (12) | 3043893 |
Common
| Value | Count | Frequency (%) |
| 2052327 | ||
| . | 353035 | 14.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15516420 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2052327 | ||
| t | 1578125 | 10.2% |
| o | 1397362 | 9.0% |
| h | 1190906 | 7.7% |
| a | 994139 | 6.4% |
| r | 949782 | 6.1% |
| u | 868430 | 5.6% |
| n | 820818 | 5.3% |
| e | 814654 | 5.3% |
| i | 726643 | 4.7% |
| Other values (14) | 4123234 |
seasons
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.7 MiB |
| summer | |
|---|---|
| autumn | |
| spring | |
| winter |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Characters and Unicode
| Total characters | 2118210 |
|---|---|
| Distinct characters | 12 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | winter |
|---|---|
| 2nd row | winter |
| 3rd row | winter |
| 4th row | winter |
| 5th row | winter |
Common Values
| Value | Count | Frequency (%) |
| summer | 127316 | |
| autumn | 102839 | |
| spring | 74990 | |
| winter | 47890 | 13.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| summer | 127316 | |
| autumn | 102839 | |
| spring | 74990 | |
| winter | 47890 | 13.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| m | 357471 | |
| u | 332994 | |
| r | 250196 | |
| n | 225719 | |
| s | 202306 | |
| e | 175206 | |
| t | 150729 | |
| i | 122880 | 5.8% |
| a | 102839 | 4.9% |
| p | 74990 | 3.5% |
| Other values (2) | 122880 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2118210 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 357471 | |
| u | 332994 | |
| r | 250196 | |
| n | 225719 | |
| s | 202306 | |
| e | 175206 | |
| t | 150729 | |
| i | 122880 | 5.8% |
| a | 102839 | 4.9% |
| p | 74990 | 3.5% |
| Other values (2) | 122880 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2118210 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| m | 357471 | |
| u | 332994 | |
| r | 250196 | |
| n | 225719 | |
| s | 202306 | |
| e | 175206 | |
| t | 150729 | |
| i | 122880 | 5.8% |
| a | 102839 | 4.9% |
| p | 74990 | 3.5% |
| Other values (2) | 122880 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2118210 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| m | 357471 | |
| u | 332994 | |
| r | 250196 | |
| n | 225719 | |
| s | 202306 | |
| e | 175206 | |
| t | 150729 | |
| i | 122880 | 5.8% |
| a | 102839 | 4.9% |
| p | 74990 | 3.5% |
| Other values (2) | 122880 | 5.8% |
Auto
The auto setting is an interpretable pairwise column metric of the following mapping:- Variable_type-Variable_type : Method, Range
- Categorical-Categorical : Cramer's V, [0,1]
- Numerical-Categorical : Cramer's V, [0,1] (using a discretized numerical column)
- Numerical-Numerical : Spearman's ρ, [-1,1]
This configuration uses the recommended metric for each pair of columns.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.| tripduration | start_station_id | start_lat | start_lon | end_station_id | end_lat | end_lon | bikeid | usertype | gender | dist | month | day | hour | min | birthyear | years_old | holiday | tempmax | tempmin | temp | feelslike | precip | dew | humidity | snow | snowdepth | windspeed | visibility | solarradiation | cloudcover | conditions | description | seasons | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 932 | 3183 | 40.716247 | -74.033459 | 3199 | 40.728745 | -74.032108 | 31929 | Subscriber | male | 1.084267 | January | weekday | 2 | 6 | 1992 | 26 | holiday | -7.8 | -13.8 | -10.7 | -17.0 | 0.0 | -19.9 | 47.8 | 0.0 | 0.1 | 18.5 | 16.0 | 106.7 | 0.3 | Clear | Clear conditions throughout the day. | winter |
| 1 | 550 | 3183 | 40.716247 | -74.033459 | 3199 | 40.728745 | -74.032108 | 31845 | Subscriber | female | 1.084267 | January | weekday | 12 | 6 | 1969 | 49 | holiday | -7.8 | -13.8 | -10.7 | -17.0 | 0.0 | -19.9 | 47.8 | 0.0 | 0.1 | 18.5 | 16.0 | 106.7 | 0.3 | Clear | Clear conditions throughout the day. | winter |
| 2 | 510 | 3183 | 40.716247 | -74.033459 | 3199 | 40.728745 | -74.032108 | 31708 | Subscriber | male | 1.084267 | January | weekday | 12 | 6 | 1946 | 72 | holiday | -7.8 | -13.8 | -10.7 | -17.0 | 0.0 | -19.9 | 47.8 | 0.0 | 0.1 | 18.5 | 16.0 | 106.7 | 0.3 | Clear | Clear conditions throughout the day. | winter |
| 3 | 354 | 3183 | 40.716247 | -74.033459 | 3267 | 40.712419 | -74.038526 | 31697 | Subscriber | male | 0.415696 | January | weekday | 14 | 53 | 1994 | 24 | holiday | -7.8 | -13.8 | -10.7 | -17.0 | 0.0 | -19.9 | 47.8 | 0.0 | 0.1 | 18.5 | 16.0 | 106.7 | 0.3 | Clear | Clear conditions throughout the day. | winter |
| 4 | 250 | 3183 | 40.716247 | -74.033459 | 3639 | 40.719252 | -74.034234 | 31861 | Subscriber | male | 0.240932 | January | weekday | 17 | 34 | 1991 | 27 | holiday | -7.8 | -13.8 | -10.7 | -17.0 | 0.0 | -19.9 | 47.8 | 0.0 | 0.1 | 18.5 | 16.0 | 106.7 | 0.3 | Clear | Clear conditions throughout the day. | winter |
| 5 | 613 | 3183 | 40.716247 | -74.033459 | 3203 | 40.727596 | -74.044247 | 31859 | Subscriber | male | 1.217917 | January | weekday | 22 | 5 | 1982 | 36 | holiday | -7.8 | -13.8 | -10.7 | -17.0 | 0.0 | -19.9 | 47.8 | 0.0 | 0.1 | 18.5 | 16.0 | 106.7 | 0.3 | Clear | Clear conditions throughout the day. | winter |
| 6 | 290 | 3183 | 40.716247 | -74.033459 | 3267 | 40.712419 | -74.038526 | 31694 | Subscriber | male | 0.415696 | January | weekday | 12 | 13 | 1958 | 60 | working_day | -3.3 | -10.9 | -7.5 | -13.0 | 0.0 | -15.7 | 52.2 | 0.0 | 0.0 | 24.3 | 16.0 | 104.4 | 4.3 | Clear | Clear conditions throughout the day. | winter |
| 7 | 381 | 3183 | 40.716247 | -74.033459 | 3205 | 40.716540 | -74.049638 | 31754 | Subscriber | female | 1.324928 | January | weekday | 12 | 50 | 1989 | 29 | working_day | -3.3 | -10.9 | -7.5 | -13.0 | 0.0 | -15.7 | 52.2 | 0.0 | 0.0 | 24.3 | 16.0 | 104.4 | 4.3 | Clear | Clear conditions throughout the day. | winter |
| 8 | 318 | 3183 | 40.716247 | -74.033459 | 3275 | 40.718355 | -74.038914 | 31816 | Subscriber | male | 0.507020 | January | weekday | 13 | 55 | 1960 | 58 | working_day | -3.3 | -10.9 | -7.5 | -13.0 | 0.0 | -15.7 | 52.2 | 0.0 | 0.0 | 24.3 | 16.0 | 104.4 | 4.3 | Clear | Clear conditions throughout the day. | winter |
| 9 | 1852 | 3183 | 40.716247 | -74.033459 | 3281 | 40.745910 | -74.057271 | 31754 | Subscriber | male | 2.826499 | January | weekday | 16 | 55 | 1976 | 42 | working_day | -3.3 | -10.9 | -7.5 | -13.0 | 0.0 | -15.7 | 52.2 | 0.0 | 0.0 | 24.3 | 16.0 | 104.4 | 4.3 | Clear | Clear conditions throughout the day. | winter |
| tripduration | start_station_id | start_lat | start_lon | end_station_id | end_lat | end_lon | bikeid | usertype | gender | dist | month | day | hour | min | birthyear | years_old | holiday | tempmax | tempmin | temp | feelslike | precip | dew | humidity | snow | snowdepth | windspeed | visibility | solarradiation | cloudcover | conditions | description | seasons | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 353025 | 667 | 3694 | 40.71113 | -74.0789 | 3195 | 40.730743 | -74.063784 | 29299 | Subscriber | male | 1.934962 | December | weekday | 11 | 39 | 1994 | 24 | working_day | 5.7 | -3.3 | 1.1 | -2.2 | 0.000 | -7.7 | 52.5 | 0.0 | 0.0 | 21.3 | 16.0 | 97.0 | 0.9 | Clear | Clear conditions throughout the day. | winter |
| 353026 | 687 | 3694 | 40.71113 | -74.0789 | 3195 | 40.730743 | -74.063784 | 29258 | Subscriber | male | 1.934962 | December | weekday | 13 | 28 | 1994 | 24 | working_day | 8.3 | 3.3 | 5.8 | 3.8 | 5.119 | 1.5 | 75.1 | 0.0 | 0.0 | 21.3 | 11.9 | 46.4 | 43.4 | cloudy_rain | Becoming cloudy in the afternoon with rain. | winter |
| 353027 | 625 | 3694 | 40.71113 | -74.0789 | 3195 | 40.730743 | -74.063784 | 29664 | Subscriber | male | 1.934962 | December | weekday | 12 | 5 | 1994 | 24 | working_day | 16.2 | 8.3 | 13.8 | 13.7 | 41.371 | 12.8 | 94.1 | 0.0 | 0.0 | 37.5 | 7.5 | 15.2 | 93.4 | cloudy_rain | Cloudy skies throughout the day with a chance of rain throughout the day. | winter |
| 353028 | 826 | 3694 | 40.71113 | -74.0789 | 3186 | 40.719586 | -74.043117 | 26306 | Subscriber | male | 2.657139 | December | weekday | 16 | 34 | 1991 | 27 | working_day | 16.2 | 8.3 | 13.8 | 13.7 | 41.371 | 12.8 | 94.1 | 0.0 | 0.0 | 37.5 | 7.5 | 15.2 | 93.4 | cloudy_rain | Cloudy skies throughout the day with a chance of rain throughout the day. | winter |
| 353029 | 640 | 3694 | 40.71113 | -74.0789 | 3195 | 40.730743 | -74.063784 | 26298 | Subscriber | male | 1.934962 | December | weekend | 10 | 27 | 1994 | 24 | working_day | 11.6 | 5.0 | 7.7 | 4.2 | 0.588 | 1.8 | 67.4 | 0.0 | 0.0 | 45.4 | 15.3 | 47.8 | 90.7 | cloudy_rain | Cloudy skies throughout the day with early morning rain. | winter |
| 353030 | 1081 | 3694 | 40.71113 | -74.0789 | 3269 | 40.726012 | -74.050389 | 29586 | Subscriber | male | 1.872980 | December | weekend | 11 | 51 | 1993 | 25 | working_day | 11.6 | 5.0 | 7.7 | 4.2 | 0.588 | 1.8 | 67.4 | 0.0 | 0.0 | 45.4 | 15.3 | 47.8 | 90.7 | cloudy_rain | Cloudy skies throughout the day with early morning rain. | winter |
| 353031 | 344 | 3694 | 40.71113 | -74.0789 | 3280 | 40.719282 | -74.071262 | 26241 | Subscriber | female | 0.828647 | December | weekday | 21 | 40 | 1983 | 35 | holiday | 4.4 | 1.2 | 2.4 | -2.0 | 0.000 | -6.4 | 52.8 | 0.0 | 0.0 | 24.9 | 16.0 | 86.5 | 33.7 | cloudy_rain | Partly cloudy throughout the day. | winter |
| 353032 | 1233 | 3694 | 40.71113 | -74.0789 | 3186 | 40.719586 | -74.043117 | 29294 | Subscriber | male | 2.657139 | December | weekend | 12 | 55 | 1988 | 30 | working_day | 13.8 | 4.3 | 9.5 | 7.4 | 0.000 | 2.7 | 63.9 | 0.0 | 0.0 | 39.2 | 15.8 | 94.2 | 64.6 | cloudy_rain | Partly cloudy throughout the day. | winter |
| 353033 | 1057 | 3694 | 40.71113 | -74.0789 | 3213 | 40.718489 | -74.047727 | 29475 | Subscriber | female | 2.315132 | December | weekend | 15 | 32 | 1991 | 27 | working_day | 3.9 | 1.1 | 2.7 | 0.0 | 0.000 | -3.1 | 66.0 | 0.0 | 0.0 | 20.8 | 15.5 | 35.6 | 73.2 | cloudy_rain | Partly cloudy throughout the day. | winter |
| 353034 | 301 | 3694 | 40.71113 | -74.0789 | 3277 | 40.714358 | -74.066611 | 26270 | Subscriber | male | 0.902881 | December | weekday | 16 | 34 | 1991 | 27 | working_day | 7.8 | 2.1 | 5.5 | 3.5 | 21.756 | 1.9 | 78.2 | 0.0 | 0.0 | 21.7 | 12.4 | 34.8 | 74.0 | cloudy_rain | Partly cloudy throughout the day with rain. | winter |